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Abstract — Multiple-input  multiple-output  (MIMO)  techniques 
have  been  actively  pursued  in  underwater  acoustic  communica¬ 
tions  recently  to  increase  the  data  rate  over  the  bandwidth-limited 
channels.  In  this  paper,  we  present  a  MIMO  system  design,  where 
spatial  multiplexing  is  applied  with  OFDM  signals.  The  proposed 
receiver  works  on  a  block-by-block  basis,  where  null  subcarriers 
are  used  for  Doppler  compensation,  pilot  subcarriers  are  used 
for  channel  estimation,  and  a  MIMO  detector  consisting  of  a 
hybrid  use  of  successive  interference  cancellation  and  soft  MMSE 
equalization  is  coupled  with  LDPC  channel  decoding  for  iterative 
detection  on  each  subcarrier.  The  proposed  design  has  been  tested 
using  data  recorded  from  three  different  experiments.  A  spectral 
efficiency  of  3.5  bits/sec/Hz  was  approached  in  one  experiment, 
while  a  data  rate  of  125.7  kb/s  over  a  bandwidth  of  62.5  kHz  was 
achieved  in  another.  These  results  suggest  that  MIMO-OFDM 
is  an  appealing  solution  for  high  data  rate  transmissions  over 
underwater  acoustic  channels. 


I.  Introduction 

To  enhance  the  transmission  rate  over  communication  links, 
either  the  bandwidth,  or  the  spectral  efficiency  in  the  unit 
of  bits/sec/Hz,  or  both,  need  to  be  increased.  Multi-input 
multi-output  (MIMO)  techniques  can  drastically  increase  the 
spectral  efficiency  via  parallel  transmissions  over  multiple 
transmitters  [3],  [4],  hence  are  attractive  to  underwater  acoustic 
communications  which  are  inherently  bandwidth-limited. 

Recently,  several  different  approaches  have  been  investi¬ 
gated  for  MIMO  underwater  acoustic  communications,  in¬ 
cluding  those  for  single  carrier  transmissions  [5]— [  11]  and 
those  for  multicarrier  transmissions  in  the  form  of  orthogonal- 
frequency-division-multiplexing  (OFDM)  [1],  [2],  [12],  [13]. 
Specifically,  adaptive  multichannel  decision-feedback  equal¬ 
ization  (DFE)  has  been  used  in  [5],  [6]  while  a  time-reversal 
preprocessing  followed  by  a  single  channel  equalizer  has  been 
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used  in  [7],  [8],  In  [5],  [6]  and  [7],  [8],  parameter  adaptation 
is  performed  on  a  symbol  by  symbol  basis.  Adaptive  block 
equalization  techniques  have  been  proposed  in  time  domain  [9] 
and  in  frequency  domain  [10],  where  parameter  adaptation  is 
carried  over  successive  blocks.  Using  basis  expansion  models 
(BEM)  to  parameterize  underwater  acoustic  channels,  block- 
differential  space  time  coding  has  been  investigated  in  [11]. 
For  multicarrier  systems,  a  non-adaptive  block-by-block  de¬ 
sign  was  presented  in  [1],  [2],  which  is  built  upon  the  receiver 
developed  for  single-transmitter  OFDM  in  [14],  while  a  block- 
adaptive  approach  was  developed  in  [12],  which  is  built  upon 
the  single-transmitter  OFDM  system  in  [15],  [16].  In  [13], 
experimental  results  were  presented  for  both  coherent  and 
differential  designs  in  an  OFDM  system  with  two  transmitters. 

The  objective  of  this  paper  is  to  present  a  MIMO-OFDM 
system  design  [1],  [2]  and  report  on  the  performance  results 
with  data  recorded  from  various  experiments.  The  proposed 
MIMO-OFDM  design  consists  of  the  following  key  compo¬ 
nents:  (1)  null  subcarriers  are  inserted  at  the  transmitter  to 
facilitate  the  compensation  of  Doppler  shifts  at  the  receiver; 
(2)  pilot  tones  are  used  for  MIMO  channel  estimation,  and  (3) 
an  iterative  receiver  structure  is  adopted  that  couples  MIMO 
detection  with  channel  decoding.  The  MIMO  detector  applied 
on  each  OFDM  subcarrier  consists  of  a  hybrid  successive 
interference  canceller  and  MMSE  equalizer  with  a  priori 
information  [17],  while  the  codes  used  are  the  nonbinary 
low-density  parity-check  (LDPC)  codes  from  [18].  Note  that 
an  iterative  receiver  has  been  investigated  in  [19]  for  an 
underwater  OFDM  system  with  one  transmitter,  where  carrier 
synchronization,  channel  estimation,  and  channel  decoding  are 
coupled.  Our  receiver  does  not  include  carrier  synchronization 
and  channel  estimation  in  the  loop.  It  rather  focuses  on  the 
iterative  processing  between  the  MIMO  detection  and  channel 
decoding. 

The  proposed  design  has  been  tested  using  data  recorded 
from  three  different  experiments:  i)  the  AUV  Fest,  Panama 
City,  FL,  June  2007,  ii)  the  RACE  experiment  in  the  Nar- 
ragansett  Bay,  Rhode  Island,  March  2008,  and  iii)  the  VHF 
experiment  conducted  at  the  Buzzards  Bay,  MA,  April  2008. 
For  convenience,  we  will  term  these  experiments  as  AUV07, 
RACE08,  and  VHF08  hereafter.  With  QPSK  modulation, 
rate  1/2  coding,  and  a  12  kHz  bandwidth,  the  achieved 
data  rate  in  AUV07  was  12.18  kb/s.  For  the  RACE08 
experiment,  we  report  MIMO-OFDM  performance  results 
of  QPSK/8-QAM/16-QAM/64-QAM  with  two  transmitters. 
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QPSK/8-QAM/16-QAM  with  three  transmitters,  and  QPSK/8- 
QAM  with  four  transmitters  where  a  bandwidth  of  4.9  kHz  is 
used.  A  spectral  efficiency  of  3.5  bits/sec/Hz  was  approached 
with  various  configurations.  In  the  VHF08  experiment,  a  data 
rate  of  125.7  kb/s  was  achieved  with  two  transmitters,  16- 
QAM  modulation,  rate  1/2  coding,  and  a  bandwidth  of  62.5 
kHz.  These  results  suggest  that  MIMO-OFDM  is  an  appealing 
solution  for  very  high  data  rate  transmissions  over  underwater 
acoustic  channels. 

The  rest  of  the  paper  is  organized  as  follows.  We  describe 
the  transmitter  design  in  Section  II  and  present  the  receiver 
algorithms  in  Section  III.  Performance  results  for  different 
experiments  are  summarized  in  Sections  IV,  V,  and  VI. 
Conclusions  are  drawn  in  Section  VII. 

Notation:  Bold  upper  and  lower  letters  denote  matrices 
and  column  vectors,  respectively;  (-)T,  (•)*,  and  (-)n  denote 
transpose,  conjugate,  and  Hermitian  transpose,  respectively; 
Iat  is  the  N  x  N  identity  matrix. 

II.  Transmitter  Design 

We  consider  MIMO-OFDM  transmission  with  spatial  mul¬ 
tiplexing  on  Nt  transmitters.  The  basic  signalling  format  is 
zero-padded  (ZP)  OFDM  [14],  Specifically,  let  B  denote  the 
bandwidth  and  K  the  number  of  subcarriers.  The  subcarrier 
spacing  is  A /  =  B/K  and  the  OFDM  block  duration  is 
T  =  1/A /  =  K/B.  Each  OFDM  block  is  followed  by  a 
guard  time  of  duration  Tg  to  avoid  interblock  interference.  Out 
of  the  total  K  subcarriers,  Kn  subcarriers  are  null  subcarriers 
where  no  information  will  be  transmitted,  Kp  subcarriers 
are  pilot  subcarriers  carrying  known  symbols,  and  the  rest 
Kd  =  K  —  K n  —  Kp  subcarriers  are  data  subcarriers. 

The  signals  are  generated  as  follows.  Let  rc  denote  the 
code  rate  of  channel  code  and  M  the  size  of  the  constellation, 
which  could  be  4,  8,  16,  or  64  in  our  experiments.  For  each 
OFDM  block,  we  generate  Nt  independent  bit  streams  each 
of  length  rcKd  log2  M  and  encode  them  separately  using  the 
nonbinary  low-density-parity-check  (LDPC)  codes  [18].  Each 
coded  bit  stream  of  length  Kd  log2  M  is  mapped  into  a  symbol 
sequence  of  length  Kd-  A  total  of  Nt  OFDM  blocks  are 
formed  with  each  block  carrying  Kr/  symbols  from  one  symbol 
sequence.  After  proper  pilot  insertions,  the  Nt  OFDM  blocks 
are  transmitted  from  Nt  transmitters  simultaneously.  The  next 
Nt  blocks  then  follow. 

Accounting  for  all  the  overheads  due  to  the  guard  interval, 
channel  coding,  pilot,  and  null  subcarriers,  the  overall  spectral 
efficiency  in  terms  of  bits  per  second  per  Hz  (bits/s/Hz)  is: 

a  =  Nt-  T  +  T  •  ^  •  rc  •  log2  M.  (1) 

With  a  bandwidth  B ,  the  data  rate  in  the  unit  of  bits/s  is 

R  =  aB.  (2) 

In  this  paper,  we  will  include  experimental  results  with 
Nt  =  2, 3, 4  transmitters.  The  total  number  of  pilot  subcarriers 
for  all  transmitters  is  K.p.  Specifically,  we  will  use  a  total  of 
Kp  =  K / 4  pilot  subcarriers  in  our  experimental  results.  To 
simplify  channel  estimation,  each  transmitter  will  be  allocated 
a  set  of  non-overlapping  pilot  subcarriers  to  transmit  nonzero 


pilot  symbols,  while  zeros  are  transmitted  on  those  subcarriers 
that  belong  to  other  transmitters.  When  Nt  =  2  and  Nt  =  4, 
each  transmitter  p  will  be  assigned  Kp/Nt  =  K/(4Nt ) 
subcarriers  that  are  equally  spaced.  For  example,  one  as¬ 
signment  on  the  pilot  indexes  to  the  pth  transmitter,  where 
p  =  l,...,Nt,  could  be  {47Vt(*  -  1)  +  4{p  -  1)  +  2}^4Nt) . 
When  Nt  =  3,  the  pilot  positions  are  identical  to  the  Nt  =  4 
case,  simply  turning  off  one  transmitter  from  the  4-transmitter 
system.  The  null  subcarrier  positions  are  identical  for  all 
transmitters.  Half  of  null  tones  are  placed  at  the  edges  of 
the  frequency  band,  while  the  other  half  are  randomly  drawn 
from  the  available  subcarrier  positions.  The  positions  of  null 
subcarriers  are  fixed  for  all  blocks  after  being  picked  during 
the  design  phase. 

III.  Receiver  Algorithms 

The  receiver  algorithms  should  be  well  designed  for  the 
underwater  acoustic  communications.  For  stationary  MIMO- 
OFDM  tests,  no  resampling  operation  as  described  in  [14]  was 
needed.  The  key  processing  steps  at  the  receiver  are  depicted 
in  Fig.  1,  and  will  be  described  next. 

A.  Doppler  estimation 

The  channel  Doppler  effect  can  be  viewed  as  caused  by 
carrier  frequency  offsets  (CFO)  among  the  transmitters  and  the 
receivers  [14].  On  each  receiver,  we  assume  a  common  CFO 
relative  to  all  transmitters,  as  in  [20,  Chapter  11.5],  Hence, 
the  CFO  estimation  algorithm  presented  in  [14,  Eqns.  (14) 
and  (15)]  is  directly  applicable,  where  the  energy  on  the  null 
subcarriers  is  used  as  the  objective  function  to  search  for  the 
best  CFO  estimate. 

After  Doppler  shift  estimation  and  compensation,  the  av¬ 
erage  energy  on  the  null  subcarriers  is  used  to  compute 
the  variance  of  the  additive  noise  and  residual  inter-carrier- 
interference  (ICI).  This  quantity  is  needed  for  the  soft  MMSE 
equalization  in  Section  III-C. 

B.  Channel  estimation 

After  CFO  compensation,  pilot  tones  are  used  for  channel 
estimation.  Note  that  at  each  receive  antenna  a,  a  total  of  Nt 
channels  {h^  :=  [/^(O), . . . ,  hv^{L  -  1)]T}^41  need  to 
be  estimated,  where  L  is  the  channel  length  and  h,/g!(l)  is  the 
Ith  channel  tap  of  the  baseband  equivalent  model. 

Since  each  transmitter  is  assigned  with  an  exclusive  set  of 
pilot  subcarriers,  channel  estimation  is  carried  out  for  each 
transmitter-receiver  pair  separately.  With  equally-spaced  pilot 
tones,  the  least-square  (LS)  channel  estimator  does  not  involve 
matrix  inversion  and  can  be  implemented  by  an  Kp/Nt- 
point  inverse  fast  Fourier  transform  (FFT),  as  described  in 
[14,  eqns.  (18)  and  (19)].  Once  the  channel  estimates 
p  =  1  ,...,Nt,  v  =  1  ,...,Nr,  are  available,  the  channel 
frequency  response  on  each  data  subcarrier  k  is  evaluated  as 

L- 1 

HuAQ  =  J2  KAl)e~j2*kl/K-  (3) 

;=o 
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Fig.  1.  The  receiver  block  diagram 


Since  Kp/Nt  pilot  tones  are  used  for  each  channel  estimator, 
our  transceiver  design  can  handle  channels  with  L  <  Kp/Nt 
taps,  which  corresponds  to  a  delay  spread  of  Kp/Nt/B 
seconds.  To  handle  longer  channels,  sparse  channel  estimation 
based  on  irregularly  spaced  pilot  tones  can  be  pursued  (see 
e.g.,  [21]),  which  is  outside  the  scope  of  this  paper. 

C.  Iterative  MIMO  demodulation  and  decoding 

On  each  data  subcarrier  k,  the  data  from  Nr  receiving- 
elements  is  grouped  into  a  vector  y[fc]  =  [t/i[fc], . . . ,  yNr[k])T  ■ 
The  vector  s[fc]  :=  [[si[fc],  ■  •  ■ ,  Sivt[fc]]T  contains  the  transmit¬ 
ted  symbols  on  the  fcth  subcarrier  from  Nt  transmitters,  and 
H[fc]  denotes  the  Nr  x  Nt  channel  matrix  with  HVIJ\k\  as  its 
(v,  /x)th  entry.  We  thus  have: 

y[k]  =  H[/c]s[A;]  +  w[fe],  (4) 

where  w[k]  contains  the  additive  noise  and  residual  ICI.  We 
assume  that  the  noises  on  different  receivers  are  uncorrelated 
and  Gaussian  distributed,  where  the  noise  variance  is  estimated 
as  the  average  energy  on  the  null  subcarriers.  For  convenience 
of  algorithm  presentation,  the  data  are  properly  scaled  so  that 
the  noise  variances  are  identical  for  all  receivers.  In  other 
words,  w[fc]  is  assumed  to  be  additive  white  Gaussian  noise. 

A  maximum  a  posteriori  (MAP)  MIMO  detector  and  a 
linear  zero-forcing  (ZF)  detector  were  presented  in  [1]  for 
the  setting  of  two  transmitters  and  QPSK  modulation.  In  this 
paper,  we  combine  successive  interference  cancellation  (SIC), 
the  soft  MMSE  equalization  method  developed  in  [17],  and 
the  nonbinary  LDPC  decoding  in  [18]  to  develop  an  iterative 
procedure  on  MIMO  demodulation  and  decoding.  The  steps 
are  as  follows. 

•  Step  1:  Initialization. 

First,  we  define  Nt  flags  to  indicate  the  decoding  success 
on  parallel  data  streams.  Initially  all  flags  are  set  to  zero 
implying  no  success. 

Second,  for  each  symbol  in  (4)  to  be  demodulated,  the 
mean  is  set  to  zero  and  the  variance  to  the  symbol  energy 
Es,  i.e.,  initially  each  symbol  has  equal  probability  of 
residing  on  all  the  constellation  points. 

Third,  to  reduce  the  complexity  of  MMSE  equalization, 
we  project  the  Nr  x  1  received  vector  onto  the  Nt 
dimensional  signal  space  as  Nr  might  be  much  larger  than 
Nt.  (This  step  is  optional,  but  can  reduce  the  matrix  size 
for  the  subsequent  iterative  MIMO  equalization  without 
compromising  the  performance.)  Specifically,  let  U[fc] 
contain  Nt  basis  vectors  of  the  range  space  of  H[fc], 
which  can  be  found  by  singular  value  decomposition.  We 


obtain 

z[fc]  =  XJH[k}y[k]  =  A[fc]s[fc]  +  £[k],  (5) 

where  A  [k]  =  Uw[fc]H[fc]  is  now  a  square  matrix  and 
£[fc]  =  Uw[A:]w[A:]  has  the  same  covariance  as  w  [k). 

•  Step  2:  Interference  cancellation. 

The  data  streams  which  are  declared  with  decoding 
success  do  not  need  to  be  decoded  again.  Hence,  their 
contributions  can  be  subtracted  from  the  received  signals. 
Assume  that  N  out  of  Nt  data  streams  remain  to  be 
decoded.  Partition  A [k]  as  [Ad[fc],  Au[fc]],  where  the  first 
part  corresponds  to  the  correctly  decoded  data  streams 
and  the  second  part  corresponds  to  the  remaining  streams. 
Similar  partition  is  performed  for  s<j[fc]  and  su  [/::].  We 
then  obtain 

z [k]  =  z[k]  -  Ad[A:]sd[fc]  =  Au[fc]su[fc]  +  £[k\.  (6) 

•  Step  3:  MMSE  equalization  with  a  priori  information. 
On  each  subcarrier  k,  the  MMSE  equalization  algorithm 
with  a  priori  information  from  [17]  is  applied.  The  inputs 
to  the  MMSE  equalizer  are  z[k\,  Au[/c],  and  the  means 
and  variances  of  all  symbols  comprising  su  [k] .  The  out¬ 
puts  of  the  MMSE  equalizer  are  the  probabilities  of  each 
information  symbol  being  equal  to  one  valid  constellation 
point.  The  details  are  provided  in  the  Appendix. 

•  Step  4:  Nonbinary  LDPC  decoding. 

With  the  outputs  from  the  MMSE  equalizer,  non-binary 
LDPC  decoding  [18]  is  run  for  each  data  stream  to  be 
decoded.  The  decoder  yields  the  decoded  information 
symbols  and  the  updated  probabilities,  which  are  used 
to  refine  the  mean  and  variance  of  each  symbol  as 

s=  Y  Ui'  P(s  =  ctj),  (7) 

Q!iG«S 

v  =  (  Y  \^\2  ■  P(s  =  ai))  -|s|2,  (8) 

a.eS 

where  S  =  {«i,  012, . . . ,  %}  denotes  the  M- ary  modu¬ 
lation  alphabet. 

During  the  decoding  process,  the  decoder  will  declare 
success  if  the  parity  check  conditions  are  satisfied. 

•  Step  5:  Iteration  among  steps  2,  3,  and  4. 

The  iteration  will  stop  after  one  more  round  of  decoding 
on  the  last  data  stream  when  the  other  Nt  —  1  streams 
have  been  successfully  decoded,  or  after  a  pre-specified 
number  of  runs. 
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IV.  Performance  Results:  AUV07 

The  experimental  data  for  this  test  was  collected  during  the 
AUV  Fest  held  in  Panama  City,  FL,  June  2007.  The  water 
depth  was  20  meters.  Two  transmitters  were  deployed  about  9 
meters  below  a  surface  buoy.  The  receiving  array  was  about 
9  meters  below  a  boat.  The  vertical  array  was  2  meters  in 
aperture  with  16  hydrophones,  out  of  which  we  used  four. 
Here  we  report  performance  results  for  transmission  distances 
of  500  and  1500  meters.  The  key  system  parameters  are  listed 
in  Table  I. 

TABLE  I 

System  Parameters  for  AUV07 


t  in  ms 


Sampling  rate 

fs  =  96  kHz 

Center  frequency 

fc  =  32  kHz 

Signal  bandwidth 

B  =  12  kHz 

OFDM  block  duration 

T  =  85.33  ms 

Guard  interval 

Tg  =  25  ms 

Subcarrier  spacing 

A  /  =  11.72  Hz 

Number  of  subcarriers 

K  =  1024 

Number  of  data  carriers 

Kd  =  672 

Number  of  pilot  carriers 

Kp  =  K/4  =  256 

Number  of  null  subcarriers 

Kn  =  96 

Spectral  efficiency  (two  transmitters, 
QPSK  modulation,  rate  1/2  coding) 

a  =  1.015  bits/s/Hz 

Data  rate 

R  =  12.18  kb/s 

A.  Channel  profiles  via  preamble  correlation 

A  linearly-frequency-modulated  (LFM)  signal  is  used  as 
a  preamble  for  synchronization.  The  correlation  results  are 
shown  in  Fig.  2  for  the  500  m  case,  and  in  Fig.  3  for  the 
1500  m  case.  It  can  be  seen  that  the  channel  at  500  m  has  a 
larger  delay  spread  than  the  channel  at  1500  m,  as  expected. 


B.  CFO  and  channel  estimation 

The  CFO  estimates  are  shown  in  Fig.  4  for  one  data  packet 
on  one  receiver.  The  CFO  is  within  [-2,  2]  Hz  range,  which 
is  caused  by  transmitter  and  receiver  drifting  with  waves. 

The  estimated  channel  for  one  OFDM  block  is  shown  in 
Fig.  5,  which  is  in  good  agreement  with  the  channel  profiles 
shown  in  Figs.  2  and  3.  It  can  be  seen  that  the  channel  for  the 
500  m  case  has  larger  energy. 

With  Kp/2  =  128  subcarriers  for  each  channel  estimation, 
we  can  estimate  128  channel  taps  in  discrete  time,  which 
amounts  to  a  delay  spread  of  10.7  ms.  Any  arrivals  after  10.7 
ms  will  thus  be  treated  as  additive  noise.  Since  the  channel 
at  500  m  has  significant  arrivals  after  10.7  ms,  the  noise  floor 
is  much  higher  (around  8  dB)  than  that  at  1500  m,  as  shown 
in  Fig.  6.  As  a  result,  although  the  signal  energy  at  500  m 
is  greater  than  that  at  1500  m,  the  pre-demodulation  signal  to 
noise  ratios  (SNRs)  become  similar  for  both  cases.  The  pre¬ 
demodulation  SNR  is  computed  as  the  ratio  of  the  average 
signal  energy  on  the  pilot  subcarriers  to  the  average  energy 
on  the  null  subcarriers. 


Fig.  2.  Channel  profile  based  on  preamble  correlation,  500  m,  AUV07. 


Fig.  3.  Channel  profile  based  on  preamble  correlation,  1500  m,  AUV07. 


Fig.  4.  Doppler  estimates  for  one  packet  of  64  OFDM  blocks  at  receiver 
1;  this  Doppler  shift  is  due  to  unintentional  drifting. 


C.  BER  results 

We  now  report  on  the  BER  results,  where  QPSK  modulation 
and  rate  1/2  nonbinary  LDPC  coding  are  utilized  and  the 
data  rate  is  12.18  kb/s.  A  total  of  64  OFDM  blocks  have 
been  transmitted,  and  hence,  each  parallel  data  stream  contains 
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Fig.  6.  Average  noise  variance  on  null  subcarriers  for  one  packet. 
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Fig.  7.  The  500  m  case  in  AUV07,  data  stream  1,  MMSE  equalization  Fig.  8.  The  500  m  case  in  AUV07,  data  stream  2,  MMSE  equalization 
followed  by  LDPC  coding;  no  block  has  decoding  errors.  followed  by  LDPC  coding;  2  out  of  64  blocks  have  decoding  errors. 


672  x  64  =  43008  information  bits.  Figs.  7  and  8  show  the 
uncoded  and  coded  BERs  for  data  streams  1  and  2  in  the  500m 
case,  respectively,  where  MMSE  equalization  is  followed  by 
LDPC  decoding  but  with  no  iteration.  Only  two  out  of  64 
blocks  have  decoding  errors  for  data  stream  2.  For  the  1500 
m  case,  there  is  no  error  after  LDPC  decoding  with  the  non¬ 
iterative  receiver. 

We  then  apply  the  iterative  MMSE  demodulation  and  chan¬ 
nel  decoding  on  the  data  of  the  500  m  case.  There  is  no 
decoding  error  after  one  round  of  iteration. 

V.  Performance  Results:  RACE08 

The  Rescheduled  Acoustic  Communications  Experiment 
(RACE)  was  held  in  the  Narragansett  Bay,  Rhode  Island, 
March  2008.  The  water  depths  in  the  area  range  from  9  to 
about  14  meters.  The  primary  source  of  an  ITC1007  transducer 
for  acoustic  transmissions  was  located  approximately  4  meters 
above  the  bottom.  A  vertical  source  array  consisting  of  three 
AT-12ET  transducers  with  a  spacing  of  60  cm  between  each 
transducer  was  deployed  below  the  primary  source.  The  top  of 
the  source  array  was  approximately  1  meter  below  the  primary 
source.  The  system  parameters  are  listed  in  Table  II. 


TABLE  II 

The  System  Parameters  for  the  RACE08  Experiment 


Sampling  rate 

fs  =  39.0625  kHz 

Center  frequency 

fc  =  11.5  kHz 

Signal  bandwidth 

B  =  4.8828  kHz 

OFDM  block  duration 

T  =  209.7152  ms 

Guard  interval 

Tg  =  25  ms 

Subcarrier  spacing 

A /  =  4.8  Hz 

Number  of  subcarriers 

K  =  1024 

Number  of  data  carriers 

Kd  =  672 

Number  of  pilot  carriers 

Kp  =  K/ 4  =  256 

Number  of  null  subcarriers 

Kn  =  96 

The  four  transducers  are  labeled  from  top  to  bottom  as  TO, 
Tl,  T2  and  T3.  For  MIMO-OFDM  transmissions,  TO  and  T1 
were  used  for  two  transmitters,  T0-T2  for  three  transmitters, 
and  T0-T3  for  four  transmitters.  TO  and  the  T1-T3  array 
were  driven  by  different  power  supplies  and  hence  they  have 
different  front-end  circuits.  In  addition,  driven  by  the  same 
voltage  inputs,  the  transducer  TO  produces  less  transmission 
power  than  T1-T3,  about  5dB  lower  comparing  the  peaks. 
Finally,  the  spacing  between  TO  and  Tl  is  greater  than  the 
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QPSK  8QAM  16QAM  64QAM 


f - 4  packets  per  data  burst - )| 

Fig.  9.  The  structure  of  the  transmission  file  for  RACE08. 

spacings  between  Tl,  T2,  T3.  Such  a  disparity  between  TO 
and  T1-T3  renders  the  data  stream  from  TO  at  a  disadvantage 
relative  to  the  other  data  streams;  this  will  be  reflected  by  the 
performance  results. 

For  each  MIMO-OFDM  configuration,  one  data  burst  con¬ 
sisted  of  four  packets  with  different  modulation  formats,  as 
shown  in  Fig.  9.  In  particular,  the  packet  of  QPSK  mod¬ 
ulation  contains  36  OFDM  blocks,  the  packet  of  8-QAM 
contains  24  OFDM  blocks,  the  packet  of  16QAM  contains 
18  OFDM  blocks,  while  the  packet  of  64-QAM  contains  12 
OFDM  blocks.  (The  8-QAM  constellation  used  in  this  paper 
is  from  [22,  Fig.  4.3-4]).  Rate  1/2  nonbinary  LDPC  coding  as 
described  in  [18]  is  applied.  Hence,  each  data  burst  contains 
the  same  number  (672  x  36  =  24192)  of  information  bits  for 
each  parallel  data  stream  at  each  setting. 

Three  receiving  arrays  were  deployed  during  the  experi¬ 
ment,  mounted  on  fixed  tripods  with  the  bottom  of  the  arrays 
2  meters  above  the  sea  floor.  We  here  report  on  the  results  for 
the  array  at  400  meters  to  the  east  from  the  source,  which  is  a 
24-element  vertical  array  with  5  cm  between  elements.  (Note 
that  half  of  the  wavelength  at  the  carrier  frequency  is  about 
6.5  cm.  The  responses  on  the  array  elements  might  be  slightly 
correlated.)  We  will  use  the  data  from  the  top  12  elements 
for  processing,  where  the  iterative  MMSE  demodulation  and 
decoding  structure  is  employed. 

During  the  experiment,  each  signal  was  transmitted  twice 
every  four  hours,  leading  to  12  transmissions  each  day.  We 
here  report  on  the  performance  results  based  on  data  collected 
from  28  transmissions  within  the  Julian  dates  081-083  (March 
21-23,  2008).  Hence,  each  data  stream  at  each  setting  has  a 
total  of  28  x  24192  =  677,  376  information  bits  transmitted. 

Fig.  10  depicts  the  channel  estimates  in  one  OFDM  block 
with  three  transmitters,  while  Fig.  11  shows  the  channel 
estimates  in  one  OFDM  block  with  four  transmitters  (from 
a  recorded  block  at  the  time  0200  GMT  on  the  Julian  date 
081).  The  channel  delay  spreads  are  about  5  ms.  Note  that  the 
channel  corresponding  to  the  first  data  stream  (transducer  TO) 
has  lower  energy  than  others.  This  is  a  general  trend  for  all 
the  received  blocks,  and  is  attributed  to  the  implementation 
differences  discussed  earlier. 

A.  Performance  results  with  two  transmitters 

Figs.  12-14  depict  the  coded  block-error-rate  (BLER)  for 
each  received  data  set  across  the  Julian  dates  081-083.  The 
8-QAM  case  is  omitted  as  it  has  zero  BLERs  across  all  dates. 
Decoding  errors  occur  only  in  one  out  of  28  data  sets  in 
the  QPSK  case,  where  two  out  of  36  OFDM  blocks  were 
badly  distorted  thus  preventing  correct  decoding  of  stream 
1.  Table  III  summarizes  the  coded  bit-error-rates  (BERs)  and 
BLERs  averaged  over  all  data  sets;  i.e.,  a  total  of  677,376 
information  bits  was  used  for  each  BER  computed  in  the  table. 


Fig.  10.  Channel  estimates  from  one  OFDM  block  with  three 
transmitters,  RACE08. 


Fig.  1 1 .  Channel  estimates  from  one  OFDM  block  with  four  trans¬ 
mitters,  RACE08. 

B.  Performance  results  with  three  transmitters 

Figs.  15,  16,  and  17  depict  the  coded  BLER  for  each 
received  data  set  across  the  Julian  dates  081-083.  Table  IV 
summarizes  the  BERs  and  BLERs  averaged  over  all  data  sets. 

C.  Performance  results  with  four  transmitters 

Figs.  18  and  19  depict  the  coded  BLER  for  each  received 
data  set  across  the  Julian  dates  081-083.  Table  V  summarizes 
the  BERs  and  BLERs  averaged  over  all  data  sets. 

Usually  no  more  than  five  iterations  were  needed  between 
MIMO  demodulation  and  decoding.  From  Tables  III-V,  we 
observe  that  the  data  steam  1  has  worse  performance  than 
the  other  data  streams.  This  is  partly  due  to  the  fact  that  the 
transducer  on  TO  has  less  power  efficiency  than  others.  We 
also  conjecture  that  there  might  exists  a  possible  Doppler  shift 
mismatch  between  TO  and  the  array  T1-T3  due  to  different 
spacings  and  front-end  circuits.  The  BLER  performance  for  all 
other  data  streams  except  stream  1  are  acceptable  and  actually 
very  good  in  many  cases.  A  closer  look  at  Figs.  15-19  reveals 
that  no  errors  occurred  in  the  majority  of  the  data  sets  within 
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Fig.  12.  Block  error  rates  for  2IMO-OFDM.  QPSK 


Fig.  13.  Block  error  rates  for  2IMO-OFDM,  16-QAM 
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Fig.  14.  Block  error  rates  for  2IMO-OFDM,  64-QAM 


Fig.  15.  Block  eiTor  rates  for  3IMO-OFDM,  QPSK 
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Fig.  16.  Block  error  rates  for  3IMO-OFDM.  8-QAM 


Fig.  17.  Block  error  rates  for  3IMO-OFDM,  16-QAM 
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Fig.  18.  Block  error  rates  for  4IMO-OFDM,  QPSK 


Fig.  19.  Block  eiTor  rates  for  4IMO-OFDM,  8-QAM 


the  three-day  span.  The  particular  case  of  two  transmitters 
and  8-QAM  modulation  having  a  spectral  efficiency  of  1.76 
bits/s/Hz  does  not  have  any  decoding  error  across  all  the  28 
data  sets  across  three  days. 

In  short,  the  spectral  efficiency  can  be  increased  consid¬ 
erably  by  using  high  order  modulation  in  MIMO-OFDM 
transmissions,  as  demonstrated  by  the  values  corresponding 
to  different  configurations  in  Tables  III-V.  In  particular,  a 


spectral  efficiency  of  3.52  bits/sec/Hz  is  approached  by  two 
parallel  64-QAM  data  streams,  or  three  parallel  16-QAM  data 
streams,  or  four  parallel  8-QAM  data  streams1. 

'Note  that  some  of  the  high  spectral  efficiencies  are  obtained  at  relative 
high  block  error  rates.  In  practice,  a  high-rate  outer  channel  code  could  handle 
those  block  errors  at  a  small  rate  loss.  We  thank  one  reviewer  for  pointing 
this  out. 
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TABLE  HI 

Performance  Results  with  Two  Transmitters  and  Twelve  Receivers,  RACE08 


Spectral  efficiency 

Data  streams 

Average  BER 

Average  BLER 

2IMO,  QPSK 

1.17  bits/s/Hz 

Stream  1 
Stream  2 

4  ■  icr4 

0 

2  MO-'3 

0 

2IMO,  8-QAM 

1.76  bits/s/Hz 

Stream  1 
Stream  2 

0 

0 

0 

0 

2IMO,  16-QAM 

2.35  bits/s/Hz 

Stream  1 
Stream  2 

cTTcF3 

3  ■  10-3 

3  ■  io~2 

1  ■  10“2 

2IMO,  64-QAM 

3.52  bits/s/Hz 

Stream  1 
Stream  2 

7- 10“4 

i  ■  icr2 

4  ■  icr1 

5  ■  icr2 

TABLE  IV 

Performance  Results  with  Three  Transmitters  and  Twelve  Receivers,  RACE08 


Spectral  efficiency 

Data  streams 

Average  BER 

Average  BLER 

3IMO,  QPSK 

1.76  bits/s/Hz 

Stream  1 
Stream  2 
Stream  3 

2  ■  10~2 

0 

5  ■  to-4 

1  rcr1 

0 

2  ■  icr3 

3IMO,  8-QAM 

2.64  bits/s/Hz 

Stream  1 
Stream  2 
Stream  3 

np 

l  ■  icr3 

8  ■  icr4 

5  ■  io-2 

6  ■  icr3 

3  ■  icr3 

3IMO,  16-QAM 

3.52  bits/s/Hz 

Stream  1 
Stream  2 
Stream  3 

4  ■  icr2 

1  ■  icr2 

2  ■  icr2 

i.6  •  icr1 

4  ■  icr2 

8  ■  icr2 

TABLE  V 

Performance  Results  with  Four  Transmitters  and  Twelve  Receivers,  RACE08 


Spectral  efficiency 

Data  streams 

Average  BER 

Average  BLER 

4IMO,  QPSK 

2.35  bits/s/Hz 

Stream  1 

00 

o 

1 

r 

N 

1 

o 

F- 

Stream  2 

2  ■  icr3 

4  ■  icr2 

Stream  3 

3  ■  icr3 

3  ■  icr2 

Stream  4 

CO 

O 

1 

CO 

3-  icr2 

4IMO,  8-QAM 

3.52  bits/s/Hz 

Stream  1 

4  ■  icr1 

2  ■  icr1 

Stream  2 

CO 

1 

o 

F- 

i  ■  icr1 

Stream  3 

i  •  icr2 

CN 

1 

O 

00 

Stream  4 

00 

O 

1 

CO 

CN 

1 

o 

00 

VI.  Performance  Results:  VHF08 

This  experiment  was  conducted  in  the  Buzzards  Bay,  MA, 
April  2008.  The  water  depth  was  12  meters.  Two  transmitters 
were  about  6  meters  below  a  surface  buoy.  The  receiving  array 
was  about  6  meters  below  a  boat.  The  array  was  1  meter  in 
aperture  with  6  hydrophones.  The  transmission  distance  was 
450  meters  with  a  very  high  frequency  (VHF)  signal  used.  We 
scale  the  basic  design  of  the  K  =  1024  case  for  the  AUV07 
experiment  to  two  different  bandwidths:  B  =  31.25  kHz  and 
B  =  62.5  kHz,  following  the  design  rules  outlined  in  [23]. 
The  parameters  used  are  listed  in  Table  VI. 

TABLE  VI 

The  System  Parameters  for  the  VHF08  Experiment 


Sampling  rate 

fs  =  500  kHz 

Center  frequency 

fa  =  110  kHz 

OFDM  block  duration 

T  =  65.5  ms 

Guard  interval 

Tg  =  20  ms 

Subcarrier  spacing 

A /  =  15.2588  Hz 

Signal  bandwidth 

B  =  31.25  kHz 

B  =  62.5  kHz 

Number  of  subcarriers 

K  =  2048 

K  =  4096 

Number  of  data  carriers 

Kd  =  1344 

Kd  =  2688 

Number  of  pilot  carriers 

Kp  =  512 

Kp  =  1024 

Number  of  null  subcarriers 

Kn  =  198 

Kn  =  396 

A.  Channel  estimation 

0.01 

0.009 

0.008 

0.007 

0.006 
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Fig.  20.  Estimated  channel  for  one  OFDM  block  on  one  receiver,  VHF08. 

The  estimated  channel  for  one  OFDM  block  is  shown  in 
Fig.  20.  The  delay  spread  is  about  4  ms. 

B.  BER  results 


The  BER  results  for  different  settings  are  listed  in  Table 
VII.  Two  transmitters  and  six  receivers  were  used.  The  results 
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TABLE  VII 

Performance  results  for  the  VHF08  experiment.  Two  transmitters,  rate  1/2  coding. 


Spectral  efficiency 

Data  rate 

Data  Streams 

Uncoded  BER 

Coded  BER 

B  =  31.25kHz,  QPSK 

1.0055  bits/s/Hz 

31.4214  kb/s 

Stream  1 

0.0025 

0 

Stream  2 

0 

0 

B  =  31.25kHz.  8-QAM 

1.5082  bits/s/Hz 

47.1320  kb/s 

Stream  1 

0.0378 

0 

Stream  2 

0.0049 

0 

B  =  31.25kHz,  16-QAM 

2.0010  bits/s/Hz 

62.8438  kb/s 

Stream  1 

0.0868 

0 

Stream  2 

0.0319 

0 

B  =  62.5kHz,  QPSK 

1.0055  bits/s/Hz 

62.8427  kb/s 

Stream  1 

0.0512 

0 

Stream  2 

0.0193 

0 

B  =  62.5kHz,  8-QAM 

1.5082  bits/s/Hz 

94.2640  kb/s 

Stream  1 

0.1102 

0 

Stream  2 

0.0488 

0 

B  =  62.5kHz,  16-QAM 

2.0110  bits/s/Hz 

125.6875  kb/s 

Stream  1 

0.1938 

0 

Stream  2 

0.1290 

0 

are  based  on  the  iterative  receiver  with  rate  1/2  nonbinary 
LDPC  coding.  There  were  36,  24,  and  18  OFDM  blocks  for 
the  cases  of  QPSK,  8-QAM,  and  16-QAM,  respectively,  when 
B  =  31.25  kHz.  There  were  18,  12,  and  9  OFDM  blocks  for 
the  cases  of  QPSK,  8-QAM,  and  16-QAM,  respectively,  when 
B  =  62.5  kHz.  Therefore,  the  BER  values  in  Table  VII  are 
averaged  over  1344  x  36  =  48,  384  information  bits  for  each 
parallel  data  stream  at  each  setting.  Error-free  performance  is 
achieved  in  this  data  set  after  no  more  than  two  rounds  of 
iterative  demodulation  and  decoding. 


VII.  Conclusions 

In  this  paper,  a  MIMO-OFDM  system  with  spatial  multi¬ 
plexing  was  presented.  The  receiver  works  on  a  block-by-block 
basis  where  null  and  pilot  subcarriers  are  used  for  Doppler  and 
channel  estimation,  respectively,  and  an  iterative  structure  is 
used  for  MIMO  detection  and  decoding.  We  reported  on  the 
performance  results  based  on  data  processing  from  three  dif¬ 
ferent  experiments,  showing  very  high  spectral  efficiency  via 
parallel  data  multiplexing  with  high  order  constellations.  These 
example  results  suggest  that  MIMO-OFDM  is  an  appealing 
choice  for  high  data  rate  underwater  acoustic  communications. 

Further  investigations  on  MIMO  underwater  acoustic  com¬ 
munications,  both  single-  and  multi-carrier  approaches,  are 
warranted,  especially  on  the  capacity  limits  in  underwater 
channels,  advanced  receiver  designs,  and  experimental  results 
in  more  challenging  channel  conditions  with  large  Doppler 
spread. 


Appendix:  MMSE  Equalization  with  a  priori 
Information  [17] 

For  convenience,  we  list  here  the  MMSE  equalization 
algorithm  with  a  priori  information  from  [17]. 

We  omit  the  index  k  and  the  subscript  in  (6)  to  work  on  a 
generic  model  z  =  As  +  £,  where  £  has  a  covariance  matrix 
<j\, Ijv-  The  a  priori  information  of  sn,  n  =  1,2 is 
given  in  the  forms  of  the  mean  sn  =  E(sn)  and  the  variance 
vn  =  Co v(sn,sn).  Let  a„  denote  the  n-th  column  of  matrix 


A,  and  use  sn  and  vn  to  define: 


S  =  E(s)  =  [si,S2,...,SA,]r, 

(9) 

z  =  E(z)  =  AE(s)  =  As, 

(10) 

V  =  Cov(s,  s)  =  diag[ui,U2, . . .  ,vN], 

(11) 

S  =  Cov(z,  z)  =  o-IIn  +  AVAh, 

(12) 

f„  =  S_1a„, 

(13) 

Kn  ~  (1  +  (1  -  Gr)f,7an)_1- 

(14) 

The  estimate  sn  is  then  computed  as 

Sn  =  Kn  ■  f^(z  -  Z  +  Sna„). 

(15) 

In  this  computation,  sn  is  independent  from  the  a  priori  in¬ 
formation  about  sn,  but  dependent  on  the  a  priori  information 
about  all  sn'  where  n'  ^  n  [17], 

Assuming  that  sn  is  Gaussian  distributed  with  mean 

f^n  —  Kn  Sp  '  f",;  a„  (16) 

and  variance 

°n  =  Kn  ■  (Can  ~  U„f”ana^fn),  (17) 

the  probabilities  p(sn\sn  =  a,;),  i  =  1,2, . . . ,  M,  can  be 
computed  from  Gaussian  probability  density  function  [17]. 
These  probabilities  are  passed  to  the  nonbinary  LDPC  decoder. 
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